Accounting for bias from sequencing error in population genetic estimates.
نویسندگان
چکیده
Sequencing error presents a significant challenge to population genetic analyses using low-coverage sequence in general and single-pass reads in particular. Bias in parameter estimates becomes severe when the level of polymorphism (signal) is low relative to the amount of error (noise). Choosing an arbitrary quality score cutoff yields biased estimates, particularly with newer, non-Sanger sequencing technologies that have different quality score distributions. We propose a rule of thumb to judge when a given threshold will lead to significant bias and suggest alternative approaches that reduce bias.
منابع مشابه
Characterizing bias in population genetic inferences from low-coverage sequencing data.
The site frequency spectrum (SFS) is of primary interest in population genetic studies, because the SFS compresses variation data into a simple summary from which many population genetic inferences can proceed. However, inferring the SFS from sequencing data is challenging because genotype calls from sequencing data are often inaccurate due to high error rates and if not accounted for, this gen...
متن کاملThe Predictability Power of Neural Network and Genetic Algorithm from Fiems’ Financial crisis
Organizations expose to financial risk that can lead to bankruptcy and loss of business is increased nowadays. This may leads to discontinuity in operations, increased legal fees, administrative costs and other indirect costs. Accordingly, the purpose of this study was to predict the financial crisis of Tehran Stock Exchange using neural network and genetic algorithm. This research is descripti...
متن کاملTwo-stage study designs combining genome-wide association studies, tag single-nucleotide polymorphisms, and exome sequencing: accuracy of genetic effect estimates
Genome-wide association studies (GWAS) test for disease-trait associations and estimate effect sizes at tag single-nucleotide polymorphisms (SNPs), which imperfectly capture variation at causal SNPs. Sequencing studies can examine potential causal SNPs directly; however, sequencing the whole genome or exome can be prohibitively expensive. Costs can be limited by using a GWAS to detect the assoc...
متن کاملProviding A Model for Management Earnings Forecast Bias
Despite The Important Role That Management Profit Forecasting Plays In The Decision Making Of Capital Market Actors, These Predictions Appear To Be Biased. In The Attempt To Measure The Bias Of Predicting Profit Management, Numerous One- Dimensional Measurement Tools Have Been Proposed In The Accounting And Finance Literature. Despite These Efforts, No Comprehensive Composite Index Has Been Dev...
متن کاملA novel approach to estimating heterozygosity from low-coverage genome sequence An Investigation Submitted to Genetics
High-throughput shotgun sequence data makes it possible in principle to accurately estimate population genetic parameters without confounding by SNP ascertainment bias. One such statistic of interest is the proportion of heterozygous sites within an individual’s genome, which is informative about inbreeding and effective population size. However, in many cases, the available sequence data of an...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Molecular biology and evolution
دوره 25 1 شماره
صفحات -
تاریخ انتشار 2008